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Abstract. The 2-dimcnsional Hamming graph H{2,n) consists of the n vertices 1 < 

' J < two vertices being adjacent when they share a common coordinate. We examine random 

. subgraphs of H(2, n) in percolation with edge probabihty p, in such a way that the average degree 

04 ' satisfies 2(n — \)p = 1 + e. Previous work has shown that in the barely supercritical region 

^ ■ n^'^/^hi'^'^ n ^ e <C 1, the largest component satisfies a law of large numbers with mean 2£n. 

, ^ \ Here we show that the second largest component has, with high probability, size bounded by 

2^e~^ log(n^e'^), so that the dominant component has emerged. This result also suggests that 
a discrete duality principle holds, where, after removing the largest connected component in the 
supercritical regime, the remaining random subgraphs behave as in the subcritical regime. 



in 



o 



1. Introduction and main result 



In their seminal work [6], Paul Erdos and Alfred Renyi noted with surprise the development of 
a giant component in the random graph G{n, p) where each of the n{n — l)/2 possible edges of the 
■ complete graph of size n is present with probability p independently of all the other edges. When 
the average degree [n — l)p satisfies {n — l)p = 1 + e, and e is positive and fixed (independent of 
Q I n), then the largest component will contain a positive proportion of the vertices while the size of 
^ ■ the second largest component is only logarithmic in n. Today we see this as a phase transition 
phenomenon exhibiting what mathematical physicists call 'mean-field' behaviour. 

For many years, there has been great interest in the barely supercritical phase of G{n,p), that 
00 ', is, the range of p values given hj p = (1 + e)/n, where e = e{n) satisfies n~^^^ <^ e = o(l). For 
O ■ convenience, we can also write e = Xn~^^^, where A = X{n) — > +oo, but does so more slowly 
^ ■ than n^^^. In this phase the dominant component has already appeared. We actually know quite 
^ , precisely that the largest component, C(i), satisfies |C(i)| = 2en{l + Op(l)) = 2An^/^(l + Op(l)) 
^ " with probability tending to 1 as n ^ cxd, where Op(l) denotes a quantity that converges to zero 
- - ' in probability; and that the second largest component, C(2), satisfies |C(2)| = 6(£:~^ log(n£:^)) = 
9(?7,^/'^A~^ In A) with probability tending to 1 as n — > oo. Thus, in particular, \C^2)\ ^ n"^^^ "C |C(i)|. 
Further, as A increases, the largest component increases in size while the second largest diminishes 
in size. (Actually, the second largest component is being frequently 'gobbled up' by the dominant 
component, generally leaving a smaller component as the new second component.) See [H [ID] for 
the proofs of these results, and [21 H] for introductions to the field. We feel, speaking quite generally, 
that an intensive study of the second largest component is vital to enhancing our understanding 
of percolation phenomena. 

We believe that the second largest component should grow until the random structure reaches 
a critical window. In that critical window, which, for G{n,p), means p = (1 + Xn~^^^)/n with A 
fixed, the first and second largest components exhibit complex chaotic behaviour. On the other 
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hand, in the barely supercritical phase, just after the critical window, the dominant, or 'giant', 
component will have asserted itself. 

In the present paper, our object of study is the 2-dimensional Hamming graph H{2,n). The 
vertices of this graph can be represented as ordered pairs 1 i, j ^ n. Vertices and 

are adjacent if and only if either i = i' or j = j'. Pictorially, H{2,n) consists of an n x 
lattice with each horizontal and vertical line being a complete graph. We write Q = 2{n — 1) for 
the vertex degree in H{2, n). We examine random subgraphs of H{2, n) in independent percolation 
with edge probability p; that is, each edge is kept with probability p and removed with probability 
1 — p, independently of all other edges. We set pc = 1/^, which will act as our critical probability; 
a justification for this definition of critical probability lies in the recent results in |3], HI [^. We 
parametrise p = {1 + e)/Q so that the average vertex degree is 1 + e. Throughout the rest of 
the paper C(i),C(2) will refer to the largest and second largest components, respectively, of the 
Hamming graph H{2,n). Also, we shall use the phrase 'with high probability' (whp) to mean 
'with probability 1 — o(l) as n oo\ 

This work continues the exploration of van der Hofstad and Luczak [7j. It was shown therein 
that, when n~'^^^{\ogny^^ <^ e <^ 1, the largest component has size 2en?{l + Op(l)). The general 
sense of a mean-field percolation event in percolation on a graph with V vertices is that there 
is a critical probability Pc, and that the barely supercritical phase occurs when p = pc{l + e) 
and V~^^^ <C e <C 1. This is the case in the Erdos-Renyi phase transition with V = n. For 
the H{2,n) phase transition, V = n^, and so the above results, up to a logarithmic term, fit 
the mean-field paradigm. Here we study the second largest component in percolation on H{2, n) 
in the barely supercritical region. In this aspect we are also able to, again up to a logarithmic 
term, fit the mean-field paradigm. In the mean-field picture of random graphs, the structure 
remaining when the dominant component is removed for p = Pc{l + e) where e = e{n) — is 
like the largest connected component in the subcritical regime with p = pc{l — e). It is well 
known that in this regime in G{n,p), the second largest component is of order e~^log{ne^). The 
upper bound is the content of our main result. In our results for H{2, n), we shall always work at 
p = Pc + e/fl = pc + e/ {2{n — 1)). In[71 (1.10) and (1.11)], it is shown that there is little difference 
in working with p = Pc + e/Q or p = (1 + e)/Q, and we refer the reader there for more details. 

Theorem 1.1 (The second component in the supercritical phase for i/(2,n)). Consider the 2- 
dimensional Hamming graph H{2,n). Let p = Pc + and let n~^/'^(logn)^/'^ <^ e -C 1. Then, 
whp, 

IC't.)! <2V2log(nV). (1.1) 

In particular, this result implies that the ratio of the sizes of the second and first largest com- 
ponents tends to zero in this regime, a salient feature of the barely supercritical phase. We feel 
that this feature should hold even without the logarithmic separation from criticality. That is, 
parametrise 2(n — l)p = 1+e and assume only n"^/^ <C e <C 1. We conjecture, following [7], that 
the largest component will have size 2£:?t,^(1 -|-Op(l)). We further conjecture that the second largest 
component will have size -C n^/^, which in particular is asymptotically smaller than the largest 
component. Let us note at this point that the logarithmic gap from the critical window (defined as 
in [HHHIS]) has recently been removed by Asaf Nachmias [H]; however, he does not establish a law 
of large numbers for the giant component, and he does not consider the second largest component. 
Further, we conjecture that, when n~^^^ -C e <C 1, |C(2)| = 6(£:~^ log(n^e^)), i.e., the bound in 
Theorem 11.11 is sharp. Thus, in particular, we conjecture that the barely supercritical regime for 
H{n, 2) has similar behaviour to that of G{n,p). 
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2. Preliminaries 

In this section, we establish a lemma for a class of branching processes that will play a key role 
in our proofs. 

We start with an inequality concerning deviations of binomial random variables below their 
mean. If X ~ Bi{k,p), then (see for instance [H]) 

P(X <kp-t)<e (2.1) 

We consider Galton- Watson processes where each individual's offspring is a random variable Z 
such that E[Z^] < oo. We always assume that our process begins with one individual. Sometimes 
we shall take Z to have a binomial distribution Bi{N,p), with p the Hamming graph edge probabil- 
ity, and a suitable positive integer. We will write fN,p for the probability measure corresponding 
to this process. We will also need Galton- Watson processes that are 'inhomogeneous', in that the 
offspring size may vary depending on the parent's 'location' in the Galton- Watson tree. 

A Galton- Watson process can be thought of as a 2-dimensional Markov chain {Qt, Gt), where Qt 
is the total progeny born until time t, and Gt is the total number of 'active' population members, 
that is those that are yet to have offspring. To be precise, we think of a Galton- Watson process 
as an evolving tree that is explored one node at a time; then Qt is the total number of nodes 
in that tree at time t, and Gt is the total number of unexplored nodes at time t. At each time 
t, if Gt > 0, then we choose one active member of the population and decide the number of 
its offspring. In a homogeneous Galton- Watson process, all population members have the same 
offspring distribution, in our case binomial distribution Bi{N,p), where G N and p G [0, 1] is 
the Hamming graph edge probability. Then, on the event {u : Gt{uj) > 1}, 

Qt+i = Qt + Zt, Gt+i = Gt + Zt ~ 1, 

where the Zt are i.i.d. Bi{N,p). We always assume Qo = Gq = 1; we let ipQ = and 

ipt = o-{Zs : s <t), t = l,2,..., 

the (T-field generated hj Zs {s = 1,2, ... ,t). 
Note that, by the above, 

lG,.,>iQt = Ig,-i>i(1 + Zo + ... + Zt^i), (2.4) 
lG,_i>iGt = Ig,-i>i(1 + ZQ + ... + Zt-i~ t), (2.5) 
where I_4 denotes the indicator of the event A. Letting Tq = inf{t : = 0}, we further see that 

Zo + . . . + Zt,-i = To - 1. (2.6) 

For convenience, we shall instead assume that our Galton- Watson processes have a continuation 
in that the random variables Zt continue to be generated even after Gt has hit 0. With this 
assumption, we may simply write, for each t, 

Qt = l + Za + ... + Zt-u Gt = l + Z^ + ... + Zt.i-t. (2.7) 

We shall now generalise this setting to a branching process where Zt are not i.i.d., but where 
each Zt is Binomial Bi{Nt,p). Here, p is the edge probability of the Hamming graph H{2,n), and 
each Nt is a random variable independent of Zt and such that, for each t, and conditionally on 

Ntp G [1 + e/2, l + e] with probability 1. (2.8) 



(2.2) 
(2.3) 
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Any such branching process will be called generically a narrow-banded (discrete) birth process, and 
in the lemma below we use P to denote the corresponding probability measure - with a slight 
abuse of notation, since in fact there is a whole class of processes satisfying property (I2.8p . 

Lemma 2.1 (Large finite progeny for narrow-banded processes). Let Q be the total population 
size of a narrow- banded birth process as defined above. Then, for a > sufficiently large, there 
exists a constant C such that 

¥{ae-^ <Q<oo)< Cee-"/^'. 
Proof. As described above, at each time t we generate Zt, a binomial Bi(iV(,p), where 

P = ^In^Ji^ ^ [1 + 1 + ^] I "Pt-i) = 1- (2-9) 

As earlier in this section, let Gq = 1, and let Gt = 1 + Zq + . . . + Zf^i — t for t = 1, 2, . . .. In other 
words, we assume our narrow-banded Galton- Watson process to have a continuation; clearly, this 
does not in any way affect the correctness of the proof of the lemma. 

Let 

S = \Gy^,-2/2^<ae~^/lQ; > for alH < ae~V2}, (2.10) 

^ = {^^-2/2] > "^^"Vl6; > for alH < ae-V2}. (2.11) 
Let To be the first time t such that Gt = 0. Then To < as~'^/2, and G^o-i = 1, so it follows that 

Q = Qn~i = Gto-i + To - 1 < ae~V2 < ae"''. 
Therefore, we can upper bound 

F{ae-^ <Q <oo) = P({ae~^ < Q < oo} n ^'^) + F{{ae~'^ < Q < oo} n S) 

< P({a£^2 < g < oo} n ^) + P(^) 

= P(ae"^ < g < oo I ^)P(^) + ¥{8), (2.12) 

since, if g > ae"^, then G^ > for all t < ae~'^/2. We shall start by bounding P(a5~^ < g < oo | 
S), and later bound F{S) and P(^). 

On the event S, for t > ae~'^/2, we couple our narrow-banded birth process with the standard 
Galton- Watson process where the offspring distribution is binomial 



Bi 



2(n-l)(l + e/2) 



' 2(72 



^ ' ^ ' (2.13) 



l + e 

with mean less than or equal to 1 + e/2. Let and G^ respectively be the total progeny and 
total number of active members at time t for this process; assume also that Q~\^ae-'2/2i ~ \<^£~^/'^Q]- 
Let to = [ae"^/2j , and for t > to + 1 let us write 

Q~ = \aE^' /16] +Z;^ + ... + Zr, Gr = [«e"Vl6l + Z'^ + . . . + Z; - {t - to), (2.14) 

where the are i.i.d. Bi ([(2(n - 1)(1 + e/2)/{l + e)J, (1 + e)/2{n - 1)). Let P" denote the 
probability measure corresponding to this process. 

The coupling is between the corresponding tree exploration processes, step-by-step, as is stan- 
dard (and as used, for instance, in [3, Section 4]), so that Zt > Z^ , Qt > and Gt > GjT for all 
t > [ae-'^/2\. This implies that 

P(supgt < oo I ^) < P"(supg7 < oo). (2.15) 

t 
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Let Qfc denote the law of a branching process with offspring distribution in fl2.13p . and starting 
from k individuals, and let Q~ be its total progeny. Then, 

p-(supgr <oo)= Qr„.-i/«i(Q" < oo) = Qi(Q^ < cx))^-"/^*'^ (2.16) 

by the independence of the evolution of the initial individuals. 
Further, [H (3.28)] shows that 

Qi{Q~ <oo) = l-e + 0{n~^ + e^), (2.17) 

so that 

p-(supg- < oo) = (1 - e + 0(n-i + ^s^^r^^-Viei < e"«/i6(i + o{a{e + (en)"^))), (2.18) 
for all e G (0, 1) and a > 0. We conclude from the above thet 

P(supQt < oo I ^) < e-"/^6(l + o(l)). 

i 

Next we shall show that F{S) and P(£') are quite small. For this, we also need an upper bounding 
Galton- Watson process {Q'^,Gf), where the offspring distribution is binomial 

^■(2("-i)-5(!ri))- 

with mean 1 + e. Let Qf and Gf respectively be the total progeny and total number of active 
members at time t for this process; assume also that the initial population size is Qq = 1. Let us 
write 

Qt = l + Z+ + ... + Z+„ Gt = l + Z^ + ... + Zl, - 1, (2.19) 

where the Z^ are i.i.d. Bi(2(n — 1), (1 + e)/2{n — 1)). We use P'^ to denote the corresponding 
probability measure. 

Similarly, we use a lower bounding Galton- Watson process {Q^,G^) (but this time starting 
from time 0, rather than from time to = L'^^"^/2j)) where the offspring distribution is binomial 
Bi (L2(n - 1)(1 + e/2)/(l +£:)J, (1 +£:)/2(n - 1)), with mean at most 1 +£:/2. Let and G^" be 
the total progeny and total number of active members at time t for this process; assume also that 
the initial population size is Qq = 1. Let us write 

Q; = 1 + Z,+... + Z-_„ G; = 1 + Z,+... + Z-_, - 1, (2.20) 

where the Zr are i.i.d. Bi ( [2(n - 1)(1 + e/2)/{l + e)\ , (1 + e)/2{n - 1)). Once again, we use P 
to denote the corresponding probability measure. 

Now we couple Zt with Zf and Z^ , so that, for alH = 0, 1, . . ., 

Z^ < Zt< Z+. (2.21) 

A suitable coupling can be achieved thanks to standard results about stochastic domination be- 
tween binomial random variables with different parameters. Explicitly, for n large enough, we may 
generate independent binomial random variables Zf such that 

Z- ~ Bi ( L2(n - 1)(1 + e/2)/(l + e)\ , (1 + £)/2(n - 1)) , 

and independent binomial random variables Wt such that 

Wt ~ Bi (2{n - 1) - [2{n - 1)(1 + e/2)/{l + e)\ , (1 + e)/2{n - 1)) . 

We can then set Z^ = Z^ + Wt for all t. We let P+'~ denote the coupling measure. 
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Let A be the event that > for all t < Iq. Let B be the event that G^^ < ae~^/16. Note 
that, under the couphng, 

SCAnB. (2.22) 

However, now it is easily seen (using [71 Proposition 3.2]) that there exists Cq = Co{a) such that 
Co — * as a — ^ oo and 

¥p~{A) = Pp(g > to) = 2E + 0{e^ + l/VTo) < (2 + co{a))e{l + o(l)). (2.23) 

This follows since to = [tt£^~^/2j. Also, using (12. ip . 

P+'~(i3) < Pp^Bi {[2{n - 1)(1 + e/2)/{l + e)\ [ae"V2j, (1 + e)/2{n - 1)) < ae-y2 + 

<exp(-a/2^), (2.24) 

for all e satisfying e ^ n~^/'^(logn)^/^, and all a and n sufficiently large, since ae^^ ^ 1 and 
ae^^ 3> ae~'^/n. 

Now, the event A is increasing, and the event B is decreasing, and both are events on the same 
probability space, corresponding to a family of independent random variables. It then follows from 
the FKG inequality that they are negatively correlated. Hence, 

Pp(^) < P+'-(^ nB)< Fp-{A)Fp-iB) < (2 + co(«))5e-°/2'(l + o(l)). (2.25) 

Also, 

P(^)<P+'-(^), 

and hence 

Fp{ae-^ <Q <oo)< F{{as'^ < Q < oo} S) + F{S) < 2(2 + co{a))ee-''^^\l + o(l)). (2.26) 



3. Proof of main result 

Recall that Q = 2{n — 1). Let Q(v) denote the component of vertex v. Our first lemma is [71 
Proposition 2.1]. 

Lemma 3.1 (Cluster tail equals the survival probability). Let e satisfy e^n^ ^ logra. Let p = 
Pc + e/Q. Let N ^ Then, for any vertex vo = {io,jo), 

Pp(|g(vo)| > at) =26(1 + 0(1)). (3.1) 

Our next lemma upper bounds the variance of Z>jv, the number of vertices in components at 
least A^. This result is a special case of [71 Corollary 2.3]. 

Lemma 3.2 (Concentration of vertices in large clusters). Let e satisfy e^n^ ^ logn. Let p = 
Pc + e/Q. Let N ^ e'"^. Then, for every 6 > 0, 

Pp(|Z>^ - Ep[Z>^]| > 6en^) = o(l). (3.2) 

We now show that, whp, there are no components of 'medium' size; that is, if a = a{n) oo 
as n — ^ oo, then any component of size at least ae"^, whp will in fact be of size at least en'^/5. 
This is the content of our next lemma: 
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Lemma 3.3 (No middle ground). Let e satisfy e^m?' ^ log n. Letp = Pc+s/Q. Let a = a{n) oo 
as n ^ oo. Then there exists a constant C such that, for n large enough, 

^p[ae'' < |Q(vo)| < ^) < C{ee~''/^" + n-^). (3.3) 

Hence, the probability that there is some vertex vq such that its component Q(vo) satisfies 

2h~^ \og{n^e^) < IQ(vo) | < en" /h (3.4) 

is o(l) as n —> oo. 

Before giving the proof of Lemma [3T3l let us state two more results from which compare the 
size of the cluster of a vertex to the total progeny of suitable Galton- Watson processes. 

The first of these is essentially [3, Lemma 4.1], proved by standard methods, and gives an upper 
bound. Let p be the probability measure corresponding to a standard Galton- Watson process 
where the family size is a binomial with parameters VL and p, and the initial population size is 1. 

Lemma 3.4 (Stochastic domination of cluster size by branching process progeny size). For every 

Pp(|Q(vo)| >£) <Pn,p(g>^). 

The second one is a slight extension of |7l Lemma 4.3], and establishes a lower bound. Let 
fi' = r2 — I max{£n~^, Clogn} and note that VL' > 2(n — 1) — ^en for n sufficiently large. It turns 
out that the cluster size can be stochastically bounded from below using a Galton- Watson process 
where the family size is a binomial with parameters Q' and p, and the initial population size is 1. 
For n sufficiently large, this process is supercritical, with mean population size at least 1 + e/2, 
since Q' > 2{n — 1) — ^en. 

Lemma 3.5 (Stochastic domination of cluster size over branching process progeny size). There is 
a constant C > such that the following holds. For every I < en^/b, 

Pp(|Q(vo)| > > ^n'AQ >() + 0(n-^), (3.5) 
where f2' = f2 — | max{£?7,~^. Clog n} > 2{n — 1) — ^en. 

Lemma 13.51 can be proved in exactly the same way as [3, Lemma 4.3], using an extension of 
[TJ Proposition 4.4] concerning the number of elements per line in large clusters from 77 -C e to 
?7 < e/5 (which is exactly the same, again, since the proof of [3, Proposition 4.4] does not in any 
way rely on 77 being of a smaller order than e). 

In fact. Lemmas 13.41 and 13.51 are not sufficient for our purposes, and we refine them in the 
following. Let Qt(vo), Gt(vo) denote the total number of vertices and the number of unexplored 
vertices at time t in the exploration of the cluster of vertex vq. Also, Qt and Gt, respectively, will 
denote the total number of nodes and the number of unexplored nodes at time t in the Galton- 
Watson tree when the offspring is binomial Bi(r2,p); and let Q[ and GJ, respectively, denote the 
total number of nodes and the number of unexplored nodes at time t in the Galton- Watson tree 
when the offspring is binomial Bi(f2',p). Let £t be the event that, for every z, no more than m 
vertices (i,a;) and no more than m vertices (x, i) have been included in the cluster of a vertex vq 
up to time t during its exploration process. Also, let £[ be the event that Q'^ < Qtiyo) < Qt and 
G[ < Gi(vo) < Gt. 

Lemma 3.6 (Sandwiching the cluster exploration). Let i = en'^/5, and let m = 5in^^/2. Then, 
if n is large enough, on the event St fl Sj-, there exists a coupling Pj^^^'.p of the cluster exploration 
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process and the upper and lower bounding Galton-Watson processes ^n^p,'Pn',p in such a way that, 
fn,n',p-almost surely, Q[^^ < Qt+i(yo) < Qt+i and G^_^^ < G't+i(vo) < Gt+i- 

It is easy to prove Lemma 13.61 using standard component exploration and coupling methods, in 
a similar way to [3, Lemmas 4.1 and 4.3], and so we omit the details. We are now ready to prove 
that there is indeed no middle ground: 

Proof of Lemma \3.3\ Lemma 13.61 implies that, on the event St fl S^., the (t + 1)**^ step of the 
exploration process of the cluster of vertex vq can be coupled with the (t + 1)**^ step of a narrow- 
banded process in Lemma [231 Since [2(n — 1) — ^en]p > 1 + e/2 for n large enough, the family 
size of the narrow-banded branching process in question (i.e., the component exploration process) 
falls into the interval [1 + e/2, 1 -|- e], as required. Now observe that Q(vo) > £ if and only if 
^^(vo) > i- We use this fact, first with £ = 2®e~^ log(n^£:"^), and then with i = e^n/b. Then, the 
first claim follows directly from Lemma [2.11 also noting that IPp(i^f) = 0(n~^) for all t < eri^/b, 
see O Proposition 4.4 and its proof]. 

As for the second claim, note that for every x, y, the number of components of size in between 
X and where < a; < is given by 



where, for the second inequality, we have used (13. 3p . Summing over all vertices v, we see that 
E[A^a:,s/] = o(l), and hence ¥p{Nx^y > 1) = o(l), as required. 

I 

We now complete the proof of Theorem ll.lt 

Proof of Theorem \l.l\ By Lemmas 13. Iti3. 21 Z-^^^e-'^iosin^e^)^ the number of vertices in components of 
size at least 2^e~'^\og{n'^e^), is concentrated around 2£:n^. In other words, the number of vertices 
in connected components of size at least 2^e~^ log(n^£^) is close to 2en^ whp. Now, from [7], we 
know that, whp, there is a giant component of size 2en^(l + Op(l)). This implies that, whp, 
there is no other cluster of size at least ev? j^. Further, by (13. 4p in Lemma [3. 3[ whp, there are no 
components of size at least 2^e~'^\o<g{v?e^^ and less than en^/b. Hence, whp, the second largest 
component must be at most 2^ e'"^ Xogin^ e^) , as claimed. I 
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(3.6) 



Let X 



2^e ^log(n^e^) and y = en'^/5. Then, for any vertex v. 




(3.7) 
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